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Abstract 


Population models for dependence between two angular measurements 
and for dependence between an angular and a linear observation are proposed. 
The method of canonical correlations first leads to new population and sample 
measures of dependence in this latter situation. An example relating wind 
direction to the level of a pollutant is given. Next, applied to pairs of angular 
measurements, the method yields previously proposed sample measures in some 
special cases and a new sample measure in general. 
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1, Introduction 


This investigation centers on the problem of correlation, or dependence, 
for circular random variables. Although several nonpara metric sample measures 
of dependence have already been proposed for angular observations, there 
seems to be no literature that treats models for correlation between a circular 
random variable and a linear random variable. Here we introduce a measure of 
dependence between circular and linear observations and a similar measure for 
dependence between two sets of angular observations based on the method of 
canonical correlation. The asymptotic distribution of the measures is discussed. 
Some population models are introduced to illustrate the proposed methods. 

These are among the first population models for random vectors taking values 
on a cylinder or on a torus and should prove useful in future studies dealing 
with other population correlation measures. 


2. Canonical Correlation Applied to a Random Variable on the Circle and a 
Random Variable on the Line 

An interesting problem is determining a measure of dependence between 
9, a random variable taking values on the unit circle, and X, a random 
variable taking values on the line. We introduce YJ - (cos 0, sin 6) to 
represent 0 as a unit vector. We now wish to determine ,a, such that Y, 
and X have maximum correlation. 

We define the covariance matrix of (Y* X)' by 


S = 


1 


var {cos 0) 


cov (cos 0, sin 0) , 

cov (cos 0, X) \ 

1 s iii s i 2 ; 

cov (cos 0, 

sin 0) 

var (sin 0) ! 

cov (sin £, X) } = 

i , : 2 ! (1) 

cov (cos 0, 

'xj " 

cov (sin 6, X) 

var (X) / 

\ ^12 ,tF 
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Let ^ be a fixed vector and b be a constant. After imposing the conditions 

2 2 

that 4 ' £^4 = I, b <r = l, the maximum correlation is given by the largest 
solution of the determinantal equation (c. f. Anderson ( 1 958), Chapter 12) 


- X. S 


11 


'21 


'12 


- \<r c 


( 2 ) 


Since the correlation p(a,' £, X) is scale invariant, we can , instead, uniquely 
determine 4 and b by imposing the restrictions that 4' 4 = 1 and b 2 = 1 . 

The maximum correlation subject to these constraints is the same as that 
obtained by using the usual constraints. We are thus finding the maximum 
p [cos (0- <*), X] over all angles a. The angular -linear correlation p^ * s thus 
defined by 


P A ^ = max p[cos(0-a), X ] 


(3) 


In order to make statistical inferences concerning Ppjj we may either 
use results related to a specific model such as maximum likelihood estimation 


theory or else large sample approximations based on the estimated covariance 
matrix . As is usual with applications of the canonical correlation method as a 
descriptive measure, one does not need to assume any specific population 
form but only use the sample covariance matrix given below by (4). The large 
sample approximations will enable us to determine confidence bounds for p^ 
as well as test for independence. 

We consider a random sample (0.,X.) where 0. is taken from a distri- 
bution on the circle and X. is taken from a distribution on the line. Consider 

1 

the representation Y . = (cos 0. , sin O.V, i = 1 n . 


t 
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- , n 

Let C = — £ cos 0., 

n i=l 1 


- I n 
S = „ S sin 0,, 

n i=l 1 


— i n 

X =- £ X., The sample 
n i=l 1 


covariance matrix for JZ j = ! Xj)* , 1 = 1, . . . , n, is given by 


I ± £ (cos e t - C)‘ 


s = 

n 


~ 2 (cos 0 - C)(sin 9 l -S) 


— £ {cos G - C)( sin 0^ - S) 


£ £(sinG l - S) 2 


\I 


£ (cos G^C) (X. - X) 


Jr(sin0j- SMX.-X) 


— £ (cos 0 -C)(X.- X ) \ 
n i i ' 

! 

^£(sin G i - SKXj-X) | 

/ 


~£(X t - X) 2 


11 

S 12 

i 

i 

S 13\ 

\ 

! s \ 

! 11 | 

s .A 

i 

21 

S 22 j 

S 23 

= 1 I- 

31 

1 

S 32 ! 

I 

S 3 3 / 

s 2l | 

S 22 ! 


(4) 


Then the sample angular-linear correlation is the largest 

solution to 

i /V 

s. 


/V 

- \ s 


11 


12 


21 


-h S 


22 


= 0 . 


(5) 


We wish to determine the asymptotic distribution of r^ . 

Now ^ S „= -t S 
v n i=i 

— i n — 

where JZ = — £ Z. , m- = E Z, . The first term of vn S is asymptotically 
n ~ ~i n 

normal according to the central limit theorem, and the last term goes to zero by 

the weak law of large numbers. Consequently \frT (S n - £) converges to a normal 

distribution with mean ^0 . Set = (T^, T^, . T^)' equal to ( s 22 s^s , ^ ' 
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and or, = ^n; 0- 2 2^33^12^13^2 3^' * By lookin 9 at the appropriate entries of S 
and S, we see VrT (£ n -£) - N(0, B) where B = E(U U') -£&' an d 
JJ = [(cos 0^ - E cos 0j) 2 , (sin 0^ - Esin0j) 2 , (X^ - EX^) 2 , 

(cos 0^ - E cos ( sin 0^ - E sin 0j), (cos 0^ - E cos Gj) (X^-EXj), 

(sin0j - E sinO^tXj- EXj) ] . 


*2 


Writing (5) in terms of T, n and solving for X 


r^ T , we obtain 


‘AL 


T|I 6 2 4 T 2 T 5 2 - 2T 4 T 5 I 6 
T 3< T 1 T 2- T 4» 


( 6 ) 


2 

Next we define g(£ n ) = r A £ wbere r A L is defined b V 


( 6 ). 


Then letting £ = 


9gCL> 

3L 


we have (g(T, n ) - p a ^) ** N(0,^'B^) 


(c. f. Anderson (1958), p. 76) and by taking the square root transformation, 
we have V n (^g(X n ) " P AL ) n ^°’Pal & * We tbus obtain 


f[_ typAL 1 

v 2 (TJ 


N ( 0, 1) 


where v[T ) = v(£) | = *' B^,p 2 | (c.f. C.R. Rao (1973), 

S=S 2=S 

n n 

p 387 ). Based on the asymptotic normality, a I00(l-a)% confidence interval 
is given by 


r V ^n >«„/» < 

AL ' sHT 


p <r 4 
P AL AL 


Vn 


where z is the upper ~ point of a standard normal distribution. 
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An Example 

The wind direction and ozone concentration were observed at a weather 
station in Milwaukee at 4 day intervals from April 18 to June 29, 1979, at 6 a. m. 


Wind direction 


in degrees 

327, 

90. 9, 

88. 2, 

Ozone 

Concentration 

28. 0, 

85. 2, 

80. 5, 

Wind direction 
in degrees 

281, 

8. 04, 

204, 

Ozone 

concentration 

31.5, 

112, 

20. 0, 

Wind direction 
in degrees 

11.5, 

27. 0. 

84. 4. 

Ozone 

Concentration 

52. 6, 

91. 8, 

55. 2. 


305, 

344, 

27 0, 

66.8, 

20. 5, 

4. 66, 

45. 9, 

12.7, 

72. 5, 

56. 6, 

86. 4, 

33 n 

18. 1. 

56.7, 

6. 03, 

72. 5, 

16 . 0 , 

45. 0, 

32. 6, 

56. 6, 


The correlation between X 

* 49 < P AL < ‘ 96 and 95 
maximum r (cos (0-a-), X) = 


and 0 is r AT = .7 2 with a 7 5% confidence bound 
bound . 32 < p^ L < 1. 00. The angle of 

r AL 1574 ■ 
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3. A Population Model for Line and the Circle 

We now consider (0,X) having the partly wrapped bivariate normal 
distribution as described in Appendix 3. We let Y, = {cos 0 sin 0) . By 
using the moments (A. 12) and elementary trigonometric identities, the co- 


variance matrix (1) becomes 


2 2 
-<r. -o'. 


1 


2 2 

/ -<r, -<r, 2 

/ |(l-e ) (1-e ‘cos 2|ij) -|<l-e )e sin 2^ -e PV 2 sin ^1 

/ 


2 2 
- 0 T -<r. 


2 rr 2 

-«r. -cr. 


Ss/ -|<l-e *)e 1 sin 2p, |<i-e l )U+e 1 cos 2pj) e 2 po^ cos Pj 


1 

T 


\ 


-e p ‘ r i tr 2 s * n 


e P <r i (r 2 cos 1A i 


The deterininantal equation (2) reduces to 

-<r 2 

M-a 

2 


\ 7 \ 2 _2< i 2 "*? 2 

> " e |,,r i) 


= 0 


This has roots X = 0, £ 


p(r i 


J. 


. The maximum root is 


pi <r 


1 


sinh <r 


1 


j 2 

vsinh tr^ 


Consideration of p[X, cos(0- a)], where a is a constant, leads to 

2 


p[X, cos(9-cf)] = 


-\l 2 sin (p^-a) e 


x. 

2 


i c £ 

s/ (1-e 1 ) (1 - e cos 2(Pj-e) ) 


kL 

For this to equal p al , we need both cos 2 (pj-g) = -1 and sin (Pj-») = - p • 
These are satisfied by a = \i^ + ~^ if p >0 and a = p^ - ^ if p < 0. We thus 


«’ 
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obtain max p [cos(G-tf),X] = p [cos(0- p - j) t X] - p[sin (G-p.), X J if p> 0 
a I 1 c 1 

| P [cos(G-Pj + -|),X ] = -p [sin ( G-jjlj), Xj if p < 0 

The maximum correlation is obtained by centering the G variable and then rotating 
by an additional tt/2 . 

4. Some Previous Measures for Dependence of Angular Observations 

The existing literature deals exclusively with nonparametric sample 
measures of dependence. Epp , Tukey, and Watson (1971) proposed a permu- 
tation test for pairs of directional observations . Let QC^, Y^) be 

a sample of pairs of unit vectors. Then they suggest that a suitable test 

n ■ n 

statistic is L = S X, Y. , or equivalently, 2 cos G. where 0 is the angle 
i=t ^ ^ i — 1 1 

between & j and .. Approximations for the permutation distribution are given for 

both statistics. 

Downs, Liebman, and McKay (1967), Downs (1974), and Stephens ( 1^73 ) 
develop methods of measuring the rotational correlation. Let . . . , Y^) 

be a sample of pairs of unit vectors. It is desired to determine the extent to 
which each^C . is a constant rotation of That is, whether each = HX 
where H is an orthogorBl matrix with determinant one. To do this, they find the 

A 

orthogonal matrix H which minimizes 

n 

f = 2 ( Y. - HX.)' (Y. - HX.) 

^ ■^'1 W'i 

A 

or, equivalently, that H which maximizes 

r* = 2 (HX)Y. (7) 

i=l ^ 


where the minimum and maximum are taken over all orthogonal matrices H. 
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Both Downs (1967) and Stephens (197 3) give a method for obtaining H. 

The first authors propose a rotational correlation coefficient which is 
analogous to the usual sample correlation coefficient and is defined by 


n -\ "v a 

-ocrX> »<£,-.!<,) 


Hi 


r = 


i=l 


A A 1 

i 


( 8 ) 


(X-'X)] 

1=1 1 1 i=l ^ 1 

where X and X are unit vectors having the same direction as resultant 
vectors of the sets (XJ and IX 

n a \ 

respectively. Stephens proposes the measures r = £ (HX<) X* where H 

i=l n 
n 

« ( 

is the orthogonal matrix which rr-ximizes (7) and r + = - (H X[) X^s where 

i=l 

H* is the orthogonal matrix with determinant one which maximizes (7). 

Rothman (1971) adapts a test for independence based on the empirical 
cumulative distribution function to a test of coordinate independence for a 
sample on the torus. Mardia (197 5) defines a correlation coefficient for 
circular data based on the ranks of the observations. Rao and Puri (197 j) 
propose a test for coordinate independence based on the number of observations 
falling in half -circles. They derive the asymptotic distribution of the test 
statistic and a computational form of this test in terms of the X and Y spacings 

5. Canonical Correlations Applied to Bivariate Circular Data. 

Let 0 L and 9^ be two random variables which take values on the unit 
circle. One may consider the representation 


X, = 
~l 


f cos G^ \ 

i 

sin 0, i 
1 / 


» 


cos G 


2 ; 


\ sin 


(°) 


-io- 


This provides a 1-1 correspondence between each angle in the interval 
[ 0 , 2 tt ) and the set of unit vectors . Based on the representation, the 
objective of the cannonical correlation method is to find a, and Jd such 
that ja' ^ and Jj* X 2 have maximum correlation. 

We let X' - i X 2 ) and denote the covariance matrix of £ by 


Var (cos 0j) 

Cov(sin 0^ cos G^> 
Cov(cos 0 2 , cos 0j) 
Cov(sln0 2 , cosGj) 


Cov (cos 0p sin 0^) 

VartsinOp 
Cov(cos 9 2 , sin 0j) 
Cov(sin0 2 , sinO^) 



l 

l 

I 

l 

z; 

21 

s , 

» 

j 

2 

21 

i 

22 


» Cov (cos Gp cos 0 2 ) 

} Cov( sin Op cos B^) 
Varieos 0 2 ) 

Cov ( sin G^, cos 0,) 

I 

f 

f 


Cov (cos Gp sin 0 2 ) \ 

Cov(sin0p sin 0,,) 
Cov (cos 0 2 , sin 0 2 ) 
Var(sin 0^) 

(10) 


Further, let ^ and Jd be constant vectors. We wish to find ^ and £ 
which maximize the correlation between ^ and >(, 2 . Generally the 
maximizing combination is made unique by imposing the conditions that 

ja = 1 and Jd' JQ, - 1* The maximum correlation is then given by the 
largest root of the determinantai equation, 


- \ 2, 


il 


12 


"21 


L. 


22 


(ID 


Since the correlation coefficient is scale invariant, we can instead 
impose the restrictions on and b, that & = b/ b - l. In this case. 
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a, and J? can be represented by 

f cos a \ cos (3 

A ~ l 1 , .fe * • ( 12 ) 

sin « sin fi 

Moreover, = cos (0^- a). Js,’ 2 = cos (°2~ P)* and t ^ ie P r °bl em becomes 

that of maximizing the correlation between cos (0,-u) and cos(0 2 -{3) over all 
o, (3 c [o, 2ir). Consequently, we define the angular canonical correlation between 
0^ and by 

P A - sup p [cos ( 0j — a ), cos(0 2 -p)]. (13) 

a,$c [o, Ztr) 

This measure is obviously invariant under rotations of 0^ and 0 2 . 

\> nil Vi; the sample measures (7) and (8) discussed above, it is, 
in general, necessary to rotate both coordinates in order to obtain the maximum 
correlation. The necessity of the two rotations will be shown for the bivariate 
wrapped normal model of Section 7. 2 and for the example of section 6. In 
section 7. 1, one rotation will be seen to be sufficient to obtain maximum 
correlation for certain models with uniform marginals. 

One important practical feature of using the method of canonical correlation 
to measure angular correlation is that it enables one to find the angular 
correlation by using standard statistical programs for canonical correlations. 

The angles a and 3 which give the maximum correlation can easily be found 
from such programs by converting the coefficients for a_ and b to unit vectors 
and then finding the o and (3 which satisfy (12). 

We also consider the following heuristic justification for 
the above method for determining angular correlation. Suppose one wishes to 
find the functions f(0^) and g(© 2 ) which have the greatest correlation. 
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The Fourier expansions of f(0j) and g(0 2 ) have the form 


OO 


oC 


f(Oj) - £ (a n cos n0^+ b n sin nG^), g(0 2 ) = s (c n cos n0 2 + d n sin n0 2 ). 

n=o n=o 

Ignoring second order and higher terms, we obtain 
f{6j) = a Q + a^ cos Oj + sin 6^ = a Q + Aj cos (0^-a) and 
g(0 2 ) = c D + c^ cos 0^ + dj sin 0^ - c o 4- A 2 cos (0 2 - p). Then 

max p[f(9|S g(0 2 ) ] == max p[a Q + Aj cos (Oj-f), C Q + A 2 cos (0 2 “P) ] 

a o' A i* ff ’ 

C o’ A 2’ ^ 


- max p [cos(0j-&), cos(0 2 -p)] - p A> 

«, P " 


We note that our proposed measure of correlation can be considered as a 
measure of rotational correlation. The following lemma shows that perfect 
correlation occurs if and only if one angle is a constant rotation of the other 
angle. 

Lemma 5. 1 If 0^ and 0 2 are circular random variables whose distributions 
have support [o. 2tr), then p a = 1 ^ 0-, = 6^ + 6 (mod 2tt), with probability 
one , for some 6. 


Proof First assume that p A = 1. Then there are some a-, P such that 
p[cos (0j-a), cos (0 2 ~P)] = 1, and by the Cauchy - Schwartz inequality, 
cos (0 2 -p) = A cos + B with probability one for some constants A and B. 

Since the support includes (0j, © 2 ) with G^and © 2 both taking all values in (o, 2 tt), 
the requirement that cos ( G^— (3) vary from -1 to -fl leads to B = 0, A = +. 1. 

For A = l, cos (0 2 ~p) = cos and 0 2 - P = 6 1 * G ^ moc ^ 2ir). We take 

f' = P - a (mod 2ir). For A = -1 , cos (0 2 - P) = -cos (0j-o) = cos (0j- a- tr) , 
and we take 6 = (3 - a - tt (mod 2tt). 
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If 0 2 = 0 l +6 ( mod 2 *0 with probability one, then 

1 > p A = max p[cos (6,- a), cos (9 2 -p) ] = max p[cos(6^-a), cosi©^ 6-p) ] 

Q,0 0,P 1 

> p [cos 0J,COS 0^] = 1, 


Remark 

We also note that a linear relationship between 0j and 0 2 does not 
imply perfect correlation. In fact we can have = 20^ (mod 2 tt) and p a = 0. 

Let 0. have the uniform distribution on [0, 2tt) and © 2 - 20^ (mod 2^). Then 
it is easy to check that cov [cos(e.-a), cos(20j-p)l ~ 0 for all g, P and 
hence, p a = 0. However, independence clearly implies p a = 0. 

6. Inference from Sample Angular Correlations 

In this section we outline a method cf obtaining the large sample 
distribution of the angular correlation coefficient, r A . This will provide 
an asymptotic method of finding confidence intervals for r A when the under- 
lying distribution is unknown. In the case wnere the family of underlying 
distributions is known, some statistic based on these distributions, such as the 
maximum likelihood estimate of p A , should be used. 

We . insider a random sample ( 0^, r|,), i = 1 n, from a bivariate 

circular distribution. Consider the representation 



' cos 0^ 
sin 0. 


*1 


'COS T}.' 
*i 

I 

! sin ri 


Z, 


V 

i 


i =1. 


n. (14) 


1 n 

Let = — S Zj be the sample mean vector. Then the sample covariance 

matrix is 
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S = - S (Z . - Z )(Z, - Z )' = 
n n ^ 'ti H ~n' 


1 


s il 

S 12 

S 13 

S 14 

i s n » s i2 \ 

1 

1 

S 21 

S 22 | S 23 

S 24 

\s 2 i| S 22 


S 31 

S 32 | 

S 33 

S 34 



S 4l 

S 42 1 

S 43 

s 44 


We now define 

~n = (S 11> S 22-> S 33' S 44' S 12 S 13< S 14' S 23 S 24> S 34 )! 


The determinantal equation (11) with S.^. replacing becomes 

A 4 ^ j 

Cj r + c 2 ^ +c 3 = o 


where C^C 2 , C^can be viewed as functions of T . By expanding (11) 
we obtain 

C l = < T lV T 5 > <T 3 T 4- T 10> 

C 2 * ‘ Wl + 2 VsVlO ' T 1 T 3 T 9 * 2T 5 T 7 T 8 T 10 * “ 5 WlO 
4 2T 3 T 5 1 7 T, 4 2T 4 T 5 I 6 T 8 - T 2 T 4 T 6 2 4 VjT, 2 


C 3 * < T 6 T 9' T 7 T 8> 


The solution to (16) becomes \ 2 = - C^± si C 2 - 4CjC 3 ) / 2C^ 


Thus, r A - \ , the largest root of (11), is given by 
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K = 


/ -C 2 ^ ^C^-4C lC 3 
n/ 2C l 


( 18 ) 


Using the same proof as in section 2 above, N/n - 2) is 

asymptotically normal with mean zero and the asymptotic covariance of the 

tH th 

(jjk) 1 entry and the (l, my entry given by 


E[Uj- Y ^i>( z m Hrn) ] - 
where 

= cos0^, z 2 = sin 0., = cos r^, z 4 = sin r^, = Ez^ . 

^ & = ( ^l*22.W44^2^13' W23.' r 24.' r 34>'- Then ^ N(0 - B > 

where B is a lO x 10 matrix with entries corresponding to those given for the 

asymptotic distribution of S. Then n/ n (r A - p a ) -* N(0,^,' B 4>) where 


£ = 


9<3C^> j 

B T I 


T 

'"•■n 


=r O’ 


(19) 


One can straightforwardly determine ^ by using (17), (18), and (19), and thus 
obtain an asymptotic variance. To find confidence bounds, we let v (2) = ,£' B^, . 
^(P R - Pa) 


Then 


A r A' 

— j 

v 2 (S) 


N(0,l) where v(S) = v(2) . The above method was 


illustrated in more detail in section 2 dealing with the correlation between 
X and 0. 

Example 

The wind direction at 6 a. m. and 12 noon were measured each day at 
a weather station in Milwaukee fov 21 consecutive days. We wish to determine 
whether the two wind directions are correlated. 


i 


“ 16 - 


Wind Direction in Degrees 


6 a. m. 

356, 

97.2, 

211, 

232, 

343, 

292, 

157 

, 302, 335, 

302, 

324, 

84. 6, 

noon 

119, 

162, 

221, 

259, 

270, 

28. 8 

, 97. 

2, 292, 39. 6, 

313, 

94. 2, 

45 , 

6 a. m. 

324, 

340, 

157, 

238, 

254, 

146, 

232, 

122, 329 




noon 

47, 

108, 

221, 

270, 

119, 

248, 

270, 

45, 23.4 





The direction of the resuitarts are 286° at 6 a. m. and 33° at noon. We 
obtain = . 567 3, with .23 < < . 90 as a 95% confidence bound for p^. 

The angles a and p for maximum correlation are c = 26° and p = 58° . 

Suppose Instead we used a single rotation procedure as suggested 
by Downs et al. (1967). Then, if we fix ©2 (noon readings), the maximum 
correlation between cos (0^-a) and cos © 2 is . 4479 for a = -12°, This is 
substantially smaller than r A * 

7. Canonical Correlation Applied to Some Bivariate Models on the Torus 
Population models of bivariate circular random variables having de- 
pendence are indispensible for studying the various measures of correlation. To 
partially fill a noticeable void in the literature, we introduce the following models. 

7. 1 Models with Uniform Marginals 

In this section we will discuss the application of canonical correlation 
to models with uniform marginals. Two possible models are given, the second 
more general than the preceding model. 

Suppose (0j,, 0 2 ) follows a bivariate circular distribution whose density 
is given by 

i IC 008(6.-0,-11 ) 

f( 0 i’ 0 2 )= T 7 TW e 1 2 °, 0 <e lt 0 2 < 2., (20) 

o 
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where k > 0 and 0 < u < 2n are the parameters. This model can be obtained 

— o 

by finding the distribution on the circle which maximizes the entropy, 

2ir 2 it 

- J J f(0 lf 0 2 ) log f(0j, e 2 ) d0 1 d0 2 , subject to the conditions 
o o 

EfcosiGj-e^ ] = A cos \i Q , E[sin(Op 6 2 ) ] = A sin jx Q , 

where A and p Q are preassigned constants (c. f. Kagan, et al. (1973), p. 409). 
In this model, the marginal distributions of and 0 2 are both uniform while 
the difference of the angles, has a von Mises distribution. 

For this model, we now consider the representation given by (9), 
and let S be defined by (10). By straightforward integration, we obtain 


1 

2 


0 

i 

i 

i 

i 

» 

COS p 

2 o 

sln ^ o 

0 


1 

2 

i 

i 

i 

i 

. .. j.* 

«*» >\> 

2 "'^ COS P o 

A(k) 

cos u. 

A( k ) 

» 

i 

sin u J 

1 

0 

2 

o 

2 

° 1 

2 


A(kJ. 

2 

sin 

A(k1 

i 

t 

0 

1 

2 

COS U l 
O | 
1 

2 


where A(«) = I^(k)/I o (k) and I p ( K ) is the modified Bessel function of the 
first kind and of p th order. The determinantal equation (11) becomes 

( k 2 - A(k) 2 ) = 0 . 

Hence, the maximum correlation between a^'lKj and Jd' X. ? subject to 

a' a - b' b = l is given by A(k). Using the representation (12), the population 

first canonical correlation is given by p = max p(cos (0 ,-q), cos( &,-£)) = A(k) 

A o 1 C 

P 
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We now determine the rotations a and (3 for which the maximum is 
attained. First of all, Cov[cos(6j-n), cos (0 2 ~p) ] = cos( a-p-p o ) , 

and Var [cos (0^-a)] = ^ar [cos (G 2 -p)] = j. Thus, p[cos(6j-a), cos(0 2 *P)] 

= A(«) cos(a-p-p o ) which is maximized for cos(<v-f3-|i 0 ) =1 or ii-p-|i Q = 0 (moa 2n). 
Hence, a-p = p Q ( mod 2 it) maximizes p, Although a, p are not unique mod 2-rr, 
one can be chosen arbitrarily and then the other will be fixed. This shows, 
in addition, that max p[cos(9,-a), cos 0 2 J = max p[cos(0^-o), c-os(0 2 ”P)l 

a a, p 

s max p[cos cos ( 0_-|3>] for this model. We thus need only rotate one of 

P 1 

the two angles to find the maximum correlation. 

Next, consider estimating A(«) using a random sample (0^ j), 

i = 1, . . . , n, from this distribution. The maximum likelihood estimate of A(k) 


' ' - - 1-2 —2 

(c. f. Mardia , p. 122) is given by A(x) = R where R = V C + S , 

_ , n - 1 n 

C = *r 2 cos (0. . - 0, and S = *• 2 sin (0. .- 0, ,). It is easy to 
n . , l, i 2, i ’ n , , i, i 2 , i 


show that R = — 2 cos (0, - 0_ . - x ) where x is the solution of 

n i=l 1, i 2, i o o 

C = R cos x , S = R sin x . That is, to estimate A(k) we rotate Q , — 0 by 

o’ o’ i c 

x q , the maximum likelihood estimate of and find 

, n _ _ 

C = - 2 (cos (0, - 0 - x ) . The use ot R fortesting for independence 

n ^ | l 1 o 

is discussed and tables given by Mardia (1972), page 136 and page 300. 

The above method coincides exactly with that suggested by Stevens (197 3) 

n n 

who proposed the sample measure r = max 2 (HX,)' Y. = max 2 cos 0. 

+ H rotations i=l H i=l 

where 0 i is the angle between HX . and Y,. when *«• 9 are unit vectors . To 
see this, we note that multiplication of jC. by a rotation matrix H corresponds to 
subtracting a constant a from 9^ . where ^ . = (cos 0^ . sin ’ , and 
n n 


max 

H rotation 


i=l 


max 


2 cos(0. . 
i=l 


0 . - a). Differentiating 

C.) 1 


a 
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_ i ' A > 

and setting equal to zero gives the maximum at o = x q , so — r + = A(k) for 
this model. 

Next we consider a more general model, still having uniform marginals, 
with density of the form 

g(0 l*°2> = h h(G r 0 2"^o ) ' °-°l’ 0 2 < 27r ’ 


where p is a parameter and h(* ) is a circular density; i. e. , h(x) > 0 and 

2tt ° „i 

I h(x) dx = 1. If we let h(x) = (2n I (k) ) exp [« cos x] , o < x < 2 tt , 

*• U 

o 

this model reduces to (20), 

2tt * 2tt 

Let A = I cos x h(x) dx, B - J sin x h(x) dx, and define jK by 
b o 

(9) and S by (10). The covariance matrix 2 becomes 


/ i 

/ 2 

0 

\ 

i ] 

j -|(A cos (i - B sin |x) 

- -^(A sin ix + B cos jx)^ 

0 

1 

2 

’ ^fA sin p + B cos p) 

-|(A cos |x - B sin (x) 


0 

““i 

! 1 

j 2 

i 

0 

\ 


i 

i 0 

t 

1 

2 


2 2 2 “ 

And equation (11), for the canonical correlations, reduces to ( K - (A + B ) ) =0. 

Hence, the maximum correlation for a,' X,^, Jd 1 X, 2 subject to a} = ,b’ ^22 ~ = ^ 

or equivalently subject to g ^ £ = l is p a = ’J A 2 + B 2 . Next represent 

^ and b_, by (12) . Then 

PUl'v&p p[cos(0j- a-), cos(0 2 -p)] = A cos(o-e-(i) + B sin (fl-P-p). 

Comparing this with v + B , we obtain tan (u-P-|j.) = — . Again a single 
rotation of either 6^ or 0 2 with the other assuming an arbitrary value is sufficient 
to obtain the maximum correlation. 

Remark It seems that the single rotation definition is closely tied to 


uniform marginals. 
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i 


7. 2 The_Wrapped Bivariate Normal Distributio n 

Let (0j, 0 2 ) be a random vector with the wrapped bivariate normal 
distribution. (See the appendix for a description of this distribution. ) Let 
&2’ and S be defined b y (9) and (10) respectively. By using the 
trigonometric moments (A. 10) and some elementary trigonometric identities, 
we obtain the covariance matrix 



-or. 


/ 

/ 1 “ l 

j — a (l-e cos 2y.j) 


11 ! , -rf sin 2 

' V 2 3 6 


1 "*1 


J ae sin 2p.^ 


1 1 
~a (1+e cos 2(4^) 


/. -icri 


f / 

l ^ — I 

I ^12 


/ 1 “ 2* <r f +0 ’2* 

2 e c os(|j. 1 +fi 2 ) + c cos(|x 1 -(j- 2 )) 


V 2 e 




(b sin(tx 1 +ji 2 )+ c sin(|x 1 -(jL 2 )) 


12 2 \ 

1 ’T'l+'P 

2 e (b sin(|j.j+n 2 )- c sin([j.j-fi )) \ 


l /fr 2. 2. 
I ~Z { l 2* 


2 e 


(-b cos( (j.|+}i 2 )+ c cosl^-fj^)) 


l Jfl _ _(r 2 


j ~2 d(l-e cos 2 [x 2 ) 


'22 


1. ."“z 


. “Td e sin 2\i. 
\ 4 < 


L rt ~ a 2 . _ 

2 d e sin 2 m- 2 


1 _Cr 2 
2 d (1+e cos 2 (jl 2 ) 
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-cr 


-pa tr 


where a = l-e 1 , b = e * 2 -L, c = e 1 ‘‘-l, d = 1 - e 


P * fz 


-<r. 


( 21 ) 


After some algebra, the equation given by (11) reduces to 
7 2 2 2 2 2 

2 -of -<r / “(o-f+o- ) 2 

-e 1 ) (l-e ) - e Z (cosh pcr^-l) ] x 


.2 -2°f -2c r 2 -(°f+<r 2 ) 

[~-(l-e 1 )(l-e )-e 1 * sinh poy 2 ] = 0 


whose solutions are 


X l = 


cosh per ^tr -1 


x z ■ 


sinh P tr | tr 2 l 


2 sinh 


sinh 


_ 2 _ 

2 


>7 2 

■s/ sinh cr^ sinh cr^ 


The correlation X^ corresponds to rotating 0^ by p^ and 0^ by p 2 , that is 
rotating both variables until they are centered at their means. The correlation 
X^ corresponds to rotating 0^ by 0^ by p^ + r~ and 0^ by p 2 if p is 
positive and to rotating 0^ by p^ - j and 0^ by p 2 ~ -■ if p is negative. 
Thus, X^ = p[cos(0 1 - p l ), cos(0 2 -p 2 )j and, if p * 0 , 

X 2 = p[co S (0 1 -p 1 -M- f), cos(0 2 -p 2 -M- f )]. 

We now give a proposition concerning these correlations and discuss 
some of their properties. 

Proposition 7. 1 

If *1 = °> then K 2 > X 1 lf P* 1; X 2 = if P = 1 or if p = 0 . 



- 22 - 


Proof 

Assume p * 0. 


/ 2 ,i 2 

£ 

(cosh po- 2 -!) 2 

sinh 2 m 2 

(cosh pm 2 -l) 2 ye* - e } 


2 2 
sinh pm 

4 sinh** — 

1 - cosh p«r <r <r d 

[ 2 *2 

\e - e ^ 


cosh pm 2 -l 


cosh ptr 2 -l (e 2<r + Ze* + ij 


2 

cosh pm -fl 

ii 

—* 

i 

fM 

b 

cosh pcr 2 ..^ ^2m 2 . 2< / 2 l f 


(cosh pm 2 -l) 

2 

(cosh o- + 1) 

2 2 2 2. 
cosh m cosh pm -1 + (cosh per -cosh tr ) 


2 

(cosh pm +1) 

(cosh m 2 - 1) 

2 2 2 2 
cosh (r cosh ptr -1 *f (cosh v -cosh p* ) 


2 2 1 

If p < l, then cosh pm < cosh m and — =— < 1 ? if p= 1. 

X 2 


cosh 



cosh <r and 



1 . 


If p = 0, X.j = - o. 


Proposition 7. I states that if the variances are equal for both marginal 
distributions, then p a = . Moreover, a numerical study indicates that 

\ > for all p when and are relatively close in value, and > \ 2 

when one variance is very much larger than the other. In other words, in the 
usual cases where the marginal variances not too different, 

P A = = p[cos( Gj- Hj- ~ ), cos(0 2 -(x 2 - j) ] = pfsinfGpij), sinf0 2 -p 2 ) J. 

It can be shown that < 1 and X 2 £ 1 with = V 2 = 1 if and 

only if p = 1 and <r^ = o* 2 ■ Thus, cos (G^- a) and cos (0 2 “ p) can be perfectly 
correlated only if the underlying bivariate normal distribution is perfectly correlated 
ard has equal variances. In this case, = 6 2 + 6 ^ or some which agrees 
with an earlier lemma. 


For general a and (5, the correlation between cos(0j“i<) and 
cos(6 2 ~p) is given by 

L ,2 , 2 . 

~ ? *^1 ? ) 

e [b cos( (Jt.-o+^L -p) + c cos((j..-a-(n 2 ’P))] 

p[cos(0.~ , a), cos( 0_-p) ] = r ■■■ " ' — 

1 4 / 2 2 

/ - tr « -°2 
V a(l-e cos 2{pj-u)) d(l-e cos 2( ^ 2 - P)) 

where a, b, c, d are defined by (21). Consequently, p a depends cn both a and 
p t and the two rotations are necessary to maximize the correlation. 
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Appendix 

A. 1. Wrapping on the Torus 

A method of forming a circular distribution from a distribution on the 
line is to wrap the distribution on the unit circle. That 

is, if X is a random variable with c. d. f. Fix) and c. f. 4>(t), then a circulai 
variable is determined from 0 = X (mod 2-n). With this transformation. G has 
c. d. f. 

oo 

F r ( 0) = £ (F(9+ 2irk) - F(2Trk)), 0 £G< 2 tt , 

k=-« 

and characteristic function <}> = <j>(p) (c. f. Mardia (1972), p. 53). 

Analogously, one can term a bivariate circular distribution by wrapping 
a bivariate distribution, defined on the plane, on the unit torus. Let (Xj, X 2 ) 
be a bivariate random vector from a distribution having c. d. f. F(£j,£ 2 ), density 
f(£^,£ 2 ), and characteristic function 4Kt^,t 2 ). Let 

0j = Xj(mod 2ir) , 0 2 = X 2 (mod 2n) . (A. 1) 

Then 0^ and 0 2 are random variables on the circle, and (0^, 0 2 ) is a random 
vector on the torus. The c. d. f. of (0^, 0 2 ) is 

00 00 

F G (0 1* 9 2 )=2 2 [F (9, + 2irj,0 +Z7rk)-F(2irj, 0 +2nk)-F(0.+2iTj, 2irk)+F(2r), 2Trk)]. 

~ 1 j =-oo k=-oo i c c i 

0 < 0^, © 2 < 2tt; and the density is given by 
00 00 

f 0 ( 0 l, e 2 ) ~ V S f (Oj+Z^rj, 0 2 +2irk), 0 < Oj. 0 2 < 2 tt . 
j = — 00 k = -00 
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Lemma A 1 The c. f. of (0,, 0 ) is 6 = <Mp,q) where p and q are 

integers. 


♦ft q * «* 


ip0,4-iqo 2tt 2tt ipG.+ iqO 

2 ) = J J e 1 2 

o ‘o 


f Q (Oj, 0 2 > d0j d© 2 


= I 2 ”, 2 ’ tp0,+ iq0 2 » 

^ • G w 


00 00 

V S' 


j=-oe k=-* 


£(0^2*), 0.,+2trk) do. 


00 

00 

2ir 2ir 

2 

j=-oe 

s 

k=-oo 

J J 

o o 

00 

00 

2ir(j+l) 

S 

2 

J 

J=“°° 

k=-00 

2tt] 

00 

1 1 

<*> ip0^+iq0 2 

e 


J e 

2 irk 


f ( 0^+2trj , 0 2 +2irk) d0^ d© 2 


f ( 0 .. 0 _)d 0 . de., 

1 c. L 


-00 -00 


f(e i> 0 2^ dG l d °2 ~ ^ P ’ 


Theorem A2. 

<|> determined at integer values of p and q is sufficient to determine 

P* 4 

the distribution of j), . 

Proof 

Define the family of functions 


u ( I, tj) = S 2 <j> 

p 2 p a 

^ 4-rt p=-oo q=-co t f 4 


00 00 |l,l| . J. . 

S 2 <j> p iP + q ' e ~ ip ^ nqT \ 0< p < 1. 

1 n rr 1 ’ — 1 


If cj> = l, (A. 2) reduces to 

P, H 


00 00 
l V* v 

2 * "1 p 

4ir p=— or q=-oo 


!pl+!q| g-ipe-i^n = c{ ? , ;P ) C(| ;P ), o< P <\ 


n**-* 
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where C(t];p) =~ S p^ p ^e** ipT1 is the p. d. f. of the wrapped Cauchy 

P=-00 


Zit 2 tt ipe^iqQ^, 

distribution (or is the Poisson kernel). Substituting J J e 'dF(0., 6-) = 

o o 1 


4> ,, into (A. 2), v/e obtain 
P, <3 ’ 


u p {!,n)=-4r s 2 j 2lT j 27r e i(pe l +qe 2 ) dF( o i> e 2 ) p lpl +iq f e' ip| " iqn 


4tr p=-oo q=-oo o o 


Ztt 2it , 


L, » Lli . | 

= .' i - 1 

o o \ 4 tt 


oo oo 

S S p 


4tt p=-oo q=-oo 




2tt 2tt 

/ J C(4-e l ;p)C(r r e 2 ;p) dF(o p e 2 ) 


(A, 4) 


o o 


The second equality follows by the dominated convergence theorem, and the 
last equality holds by applying (A3). 

Using (A. 4) and applying Fubini's Theorem, we obtain 


“l °2 


°1 a Z 2 tt Zn 


lim f j u (£, g) d £ dr| = Urn f J J J C(£-e,;p)C(r|-e ;p)dF(6., 0 )d£ dg 
p o o ^ p“*l o o o o 

2tt 2tt a Z (A* 5 ) 

m J J J C(£-e;p)d£ J C( n - 0 2 ;p)dn dF(e.,e ). 

►1 o o o 1 o c 


= lim 
P* 


A lemma concerning the Poisson kernel (c. f. Feller (1971), p. 627) states that 


lim j C(|-0 5 p) d£ = I, , 
p-H o (o ’ aj 


(e) 


(A. 6) 


The dominated convergence theorem and (A, 6) give 
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Q 1 

- F( o j, Q 2)* 

Hence, F(«j, a^) can be obtained by a limiting process from the u p (£, n) 
which depend only on <J> p ^ . 


lim / 
P-1 o 


h) 


2tt 2 it 

d i dn = f J 

o o 


I (o, < , 1 ]<V I ( o > » 2 J« 6 2 ' dP < e l' e 2 > 


A. 2 . The Wrapped Bivariate Normal Distribution 


Let (X^X^.) be bivariate normal with mean vector £ 


matrix E where 
2 = 


/ 


PT 2 




and covariance 


(A. 7) 


Define £ by (A. 1) so, according to Lemma A 1 , the characteristic function of 
0 is 


4*p q = exp {-|(p 2 <r^ + 2pq P*f 2 + + qP- 2 ) } 


(A. 8) 


Since 


♦m - 06 


ip0j+ iq 0^ 


) = E cos(p0 1 + q0 2 ) + i E sin(p Oj+q 0 2 ) (A. 9) 


for any random vector ,0, , equating (A. 9) and (A. 8), we obtain 

1 2 2 2 2 

E cos (p0 1 +q0 2 ) = cos(pnj+qK 2 ) exp {- -j { p o-^ + 2pq P°' 1 °‘ 2 + q ^ ) } 
E sin (p0 1 +q9 2 ) = sin(ppj +qp 2 ) exp { (p 2 ir 2 + 2pq pcr^ + q 2 <r 2 ) } 


(A. 10) 



- 28 - 


A. 3 A Model for (0,X) 

One model for (0, X) can be formed from a bivariate normal distribution. 
Let (Y,,Y_) have mean vector p. and covariance matrix 2 defined by (A. 7). 
Let 0 = Yj(mod 2 tt), X = Y^. Then (0,X) has a distribution with the desired 
support and having characteristic function 

4>( P, t) = exp{-~(p 2 (rf + 2p tp + q 2 <r 2 ) + i (p Hj+t^) } . (A. 11) 

The proofs that this is the desired c. f. and that it is necessary to determine 
<j>(p, t) only for integer p and real t are entirely analogous to those presented 
in section A, 1. The moments E(X cos 0) and E(X sin 6) can be easily 
determined from the c. f. 

Lemma A 3 If E|x| exists and is finite, then 

= - E(X sin 0) + i E(X cos 0). 

t =o 
P=1 

Proof The c. f. Is «p, t) = E(e lpe * it5! ). Then 

fr-ir /V e^ + ^dF,e,x) 

o -°o 

- ff -t e ‘ pe+ltx 

o -oo 

= J' f ix e ip0 + ltx dF( 0 , x) . 

O -00 

The interchange of differentiation and integration is justified by the fact 
E|X| < oo and the dominated convergence theorem. 


at 
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If (0. X) has c. f. given by (A. 11), 


M £J0. 

at 


t=o 

p=i 



[{-p <rf 2 cos |ij - p 2 sin (x^) + i (p 2 cos sin p^) ] . 


Hence, the moments are given by : 



2 


E (X cos 9) - e (p 2 cos Pj - P*f z sin pj) 

.iL 

E(X sin 6) = e 2 (p 2 sin + P cr f 2 cos h-j_) • 


(A. 12) 


We note that the marginals of this distribution are the wrapped normal 
with parameters pj and o' 2 and the normal distribution with parameters p 2 
and cr 2 for 9 and X respectively. This can be seen by considering t = 0 
and p = 0 in the c. f. 4>( P, t). 
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